Speedups from Executing Critical Software Segments to Coarse-Grain Reconfigurable Logic
نویسندگان
چکیده
In this paper, we propose a hardware/software partitioning method for improving applications’ performance in embedded systems. Critical software parts are accelerated on hardware of a single-chip generic system comprised by an embedded processor and coarse-grain reconfigurable hardware. The reconfigurable hardware is realized by a 2-Dimensional array of Processing Elements. A list-based mapping algorithm has been developed for estimating the execution cycles of kernels on Coarse-Grain Reconfigurable Arrays. The proposed partitioning flow has been largely automated for a program description in C language. Extensive hardware/software experiments on five real-life applications are presented. The results illustrate that by mapping critical code on coarse-grain reconfigurable hardware, speedups ranging from 1.2 to 3.7, with an average value of 2.2, are achieved. Key-Words: Reconfigurable embedded systems, Coarse-grain reconfigurable array, performance improvement, partitioning, kernel identification, mapping.
منابع مشابه
Speedups from extending embedded processors with a high-performance coarse-grained reconfigurable data-path
In this paper, an embedded system that extends microprocessor cores with a high-performance coarsegrained reconfigurable data-path is introduced. The data-path have been previously introduced by the authors. It is composed by computational resources able to realize complex operations which aid in improving the performance of time critical application parts, called kernels. A compilation flow is...
متن کاملPerformance Improvements of DSP Applications on a Generic Reconfigurable Platform
Speedups from mapping four real-life DSP applications on an embedded system-on-chip that couples coarsegrained reconfigurable logic with an instruction-set processor are presented. The reconfigurable logic is realized by a 2-Dimensional Array of Processing Elements. A design flow for improving application’s performance is proposed. Critical software parts, called kernels, are accelerated on the...
متن کاملDataflow based Near-Data Processing using Coarse Grain Reconfigurable Logic
The emergence of 3D-DRAM has rekindled interest in near-data processing research. This article introduces and describes a method for near-data processing using hybrid dataflow techniques implemented with coarse grain reconfigurable logic. We provide an initial evaluation of the concept to justify further development. The melding of these technologies produces increased throughput while dramatic...
متن کاملA Partitioning Flow for Accelerating Applications in Processor-FPGA Systems
This paper presents a hardware/software partitioning flow for improving performance in systemson-chip comprised by processor and Field Programmable Gate Array. Speedups are achieved by executing critical software parts on the reconfigurable FPGA logic. A generic hybrid system architecture is considered by the methodology. The partitioning flow uses an automated analysis process at the basic-blo...
متن کاملPerformance Improvements of Microprocessor Platforms with a Coarse-Grained Reconfigurable Data-Path
This paper presents the performance improvements by coupling a high-performance coarse-grained reconfigurable data-path with a microprocessor in a generic platform. It is composed by computational units able to realize complex operations which aid in improving the performance of time critical application parts, called kernels. A design flow is proposed for mapping software descriptions to the m...
متن کامل